AITopics | Amherst Center

What latent features are encoded in language model (LM) representations? Recent work on training sparse autoencoders (SAEs) to disentangle interpretable features in LM representations has shown significant promise. However, evaluating the quality of these SAEs is difficult because we lack a ground-truth collection of interpretable features that we expect good SAEs to recover. We thus propose to measure progress in interpretable dictionary learning by working in the setting of LMs trained on chess and Othello transcripts. These settings carry natural collections of interpretable features -- for example, "there is a knight on F3" -- which we leverage into $\textit{supervised}$ metrics for SAE quality. To guide progress in interpretable dictionary learning, we introduce a new SAE training technique, $\textit{p-annealing}$, which improves performance on prior unsupervised metrics as well as our new metrics.

autoencoder, language model interpretability, sae, (11 more...)

arXiv.org Artificial Intelligence

2408.00113

Country:

Europe > Austria > Vienna (0.14)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Massachusetts > Hampshire County > Amherst Center (0.04)
(3 more...)

Genre: Research Report > New Finding (0.93)

Industry: Leisure & Entertainment > Games > Chess (0.51)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Can Neural Networks Understand Logical Entailment?

@machinelearnbotJan-31-2018, 15:25:02 GMT

OpenReview is created by the Information Extraction and Synthesis Laboratory, College of Information and Computer Science, University of Massachusetts Amherst. We gratefully acknowledge the support of the OpenReview sponsors: Google, Facebook, NSF, the University of Massachusetts Amherst Center for Data Science, and Center for Intelligent Information Retrieval, as well as the Google Cloud Platform for donating the computing and networking services on which OpenReview.net runs.

cloud computing, machine learning, natural language, (5 more...)

@machinelearnbot

Country:

North America > United States > Massachusetts > Hampshire County > Amherst (0.74)
North America > United States > Massachusetts > Hampshire County > Amherst Center (0.39)

Industry: Information Technology > Services (0.83)

Technology:

Information Technology > Cloud Computing (1.00)
Information Technology > Data Science (0.83)
Information Technology > Artificial Intelligence > Natural Language (0.83)
(2 more...)

Add feedback

Many Bills: Visualizing the Anatomy of Congressional Legislation

Aktolga, Elif (University of Massachusetts Amherst) | Ros, Irene (IBM Watson Research Center) | Assogba, Yannick (IBM Watson Research Center) | DiMicco, Joan (IBM Watson Research Center)

AAAI ConferencesAug-8-2011

US Federal Legislation is a common subject of discussion and advocacy on the web. The contents of bills present a significant challenge to both experts and average citizens due to their length and complex legal language. To make bills more accessible to the general public, we present Many Bills: a web-based visualization prototype that reveals the underlying semantics of a bill. We classify the sections of a bill into topics and visualize them using different colors. Further, using information retrieval techniques, we locate sections that don't seem to fit with the overall topic of the bill. To highlight outliers in our `misfit mode', we visualize them in red, which builds a contrast against the remaining gray sections. Both topic and misfit visualizations provide an overview and detail view of bills, enabling users to read individual sections of a bill and compare topic patterns across multiple bills. We obtained initial user feedback and continue collecting label corrections from users through the interface.

information retrieval, machine learning, natural language, (18 more...)

AAAI Conferences

Workshops at the Twenty-Fifth AAAI Conference on Artificial Intelligence

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
North America > United States > Massachusetts > Hampshire County > Amherst (0.14)
North America > United States > New York > New York County > New York City (0.06)
(2 more...)

Industry:

Law > Statutes (1.00)
Government (1.00)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.54)

Add feedback